Target-Free Text-Guided Image Manipulation

نویسندگان

چکیده

We tackle the problem of target-free text-guided image manipulation, which requires one to modify input reference based on given text instruction, while no ground truth target is observed during training. To address this challenging task, we propose a Cyclic-Manipulation GAN (cManiGAN) in paper, able realize where and how edit regions interest. Specifically, editor cManiGAN learns identify complete image, cross-modal interpreter reasoner are deployed verify semantic correctness output instruction. While former utilizes factual/counterfactual description learning for authenticating semantics, latter predicts "undo" instruction provides pixel-level supervision training cManiGAN. With above operational cycle-consistency, our can be trained weakly supervised setting. conduct extensive experiments datasets CLEVR COCO datasets, effectiveness generalizability proposed method successfully verified. Project page: sites.google.com/view/wancyuanfan/projects/cmanigan.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text-Guided Attention Model for Image Captioning

Visual attention plays an important role to understand images and demonstrates its effectiveness in generating natural language descriptions of images. On the other hand, recent studies show that language associated with an image can steer visual attention in the scene during our cognitive process. Inspired by this, we introduce a text-guided attention model for image captioning, which learns t...

متن کامل

Image-guided Precision Manipulation of Cells and Nanoparticles in Microfluidics

Title of Document: IMAGE-GUIDED PRECISION MANIPULATION OF CELLS AND NANOPARTICLES IN MICROFLUIDICS Zachary Cummins, Doctor of Philosophy, 2016 Directed By: Professor Benjamin Shapiro, Fischell Department of Bioengineering Manipulation of single cells and particles is important to biology and nanotechnology. Our electrokinetic (EK) tweezers manipulate objects in simple microfluidic devices using...

متن کامل

Manipulation in advertising text: lexical and semantic aspect

The present paper focuses on the questions of modern advertising science, structure of advertising and elements making actual manipulative influence from the addresser. Advertising encourages product sales, is an instrument of forming ethical standards, values, creating cultural values, standards and mode of behavior that is why the wide system of means for achieving aims of advertisers is need...

متن کامل

Visually guided manipulation tasks

In this paper, we present a framework for a robotic system with the ability to perform real-world manipulation tasks. The complexity of such tasks determines the precision and freedoms controlled which also affects the robustness and the flexibility of the system. The aspect is on the development of visual system and visual tracking techniques in particular. Since precise tracking and control o...

متن کامل

Post-Prostatectomy Image-Guided Radiotherapy: The Invisible Target Concept

In the era of intensity-modulated radiation therapy, image-guided radiotherapy (IGRT) appears crucial to control dose delivery and to promote dose escalation while allowing healthy tissue sparing. The place of IGRT following radical prostatectomy is poorly described in the literature. This review aims to highlight some key points on the different IGRT techniques applicable to prostatic bed radi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i1.25134